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Abstract — A polar coding scheme for fading channels is pro- 
posed In this paper. More specifically, the focus is Gaussian fading 
channel with a BPSK modulation technique, where the equivalent 
channel could be modeled as a binary symmetric channel with 
varying cross-over probabilities. To deal with variable channel 
states, a coding scheme of hierarchically utilizing polar codes is 
proposed. In particular, by observing the polarization of different 
binary symmetric channels over different fading blocks, each 
channel use corresponding to a different polarization is modeled 
as a binary erasure channel such that polar codes could be 
adopted to encode over blocks. It is shown that the proposed 
coding scheme, without instantaneous channel state information 
at the transmitter, achieves the capacity of the corresponding 
fading binary symmetric channel, which is constructed from 
the underlying fading AWGN channel through the modulation 
scheme. 

I. Introduction 

Polar codes are the first family of provably capacity achiev- 
ing codes for arbitrary symmetric binary-input discrete mem- 
oryless channels (B-DMC) with low encoding and decoding 
complexity IT] IS). Channel polarization has then been gener- 
alized to arbitrary discrete memoryless channels with the same 
order of construction complexity and error probability behav- 
ior ||3|. Moreover, polar codes are also proved to be optimal 
for lossy compression with respect to binary symmetric source 
lIUISl, and then further extended to larger source alphabet [6J. 

Polar codes also contribute significantly to non-discrete 
input channels. By adopting polar codes as embedded codes 
at each expanded level, expansion coding scheme |7| achieves 
the capacity of additive exponential noise channel in high 
SNR region with low coding complexity. Besides, a polar 
coding scheme achieving capacity for additive Gaussian noise 
channel is investigated in IS], which utilizes the polarization 
result for multiple access channel ||9l. It has been shown 
that the approach of using a multiple access channel with a 
large number of binary-input users has much better complexity 
attributes than the one of using a single-user channel with large 
input cardinality. 

In this paper, we investigate the polar coding scheme for 
binary-input AWGN channel ifTOlllfTTl in a fading scheme. 
By adopting BPSK modulation and demodulation technique, 
additive Gaussian noise fading channel has been boiled down 
to a binary symmetric channel (BSC) with finite set of tran- 
sition probabilities according to the channel quality. The key 
intuition of the proposed scheme is based on observing the po- 
larization characteristics of different BSCs. By hierarchically 



using polar codes, where the transmitter encodes over blocks, 
it can be proved that the designed coding scheme achieves the 
capacity of converted channel (fading BSC). 

The rest of paper is organized as follows. After introducing 
the preliminary results on polar codes and problem background 
in Section II and III, respectively, the polar coding scheme for 
fading channels is stated and illustrated in Section IV. The 
paper concludes with a discussion section. 

II. Preliminary for Polar Codes 

The construction of polar code is based on the observa- 
tion of channel polarization. Consider a binary-input discrete 
memoryless channel W : X ^ y, where X = {0, 1}. Define 



F = 



1 
1 1 



Let Bn be the bit-reversal operator defined in [[T], where N = 
2". By applying the transform Gn = BnF'»"^ (F^"" denotes 
the n* Kronecker power of F) to ui-^n, consider transmitting 
the encoded output xi-^ through N independent copies of W. 
Then N new binary-input coordinate channels W^ : X — > 
y^ X X^^^ are constructed, where for each i e {!,..., N} 
the transition probability is given by 



wl^\yi:N,ui,i. 



"i + l:JV 



1 



M^'Myi:iv|ui:JvGjv). 



Then as N tends to infinity, the channels {W]^ } polarize to 
either noiseless or pure-noisy, and the fraction of noiseless 
channels is close to I{W), the symmetric mutual information 
of channel W Q. 

To this end, polar codes can be considered as GAr-coset 
codes with parameter {N,K,A,ua':), where u^c g X'^^^ 
is frozen vector (can be set to all-zero vector for symmetric 
channel |'1|), and the information set A is chosen as a K- 
element subset of {!,..., iV} such that the Bhattacharyya 
parameters satisfies Z(WJ^^) < Z(VK^^) for all i e A and 
J e A-. 

The decoder in polar coding scheme is successive cancela- 
tion (SC) decoder, which gives an estimate ui;n of Ui-jy given 
knowledge of A, u^c, and yi-N by computing 



0, 

di{yi:N,Ul:l^ 



if i e y^S 



in the order i from 1 to N, where 



I{W^ 



(»(i))N 



diiyi:N,Ul:i- 




JJ W^^>(yi:N,Ul:.-l\0) > ^ 

Otherwise. 



It has been proved that by adopting SC decoder, polar codes 
achieves any rate R < I{W) with decoding error scaHng 
as 0{2~^ ), where j3 < 1/2. Moreover, the encoding and 
decoding complexity of polar codes are both 0{N log N). 

III. Problem Background 

Fading channels characterize the wireless communication 
channels, where the channel states are changing over times 
and only available at the decoders. Fading coefficient typically 
varies much slower than transmission symbol duration in 
practice. To this end, a block fading model l,12J is proposed, 
whereby the state is assumed to be a constant over coherence 
time intervals and stationary ergodic across fading blocks. 

Consider the AWGN fading channel. 



yb,i — Hi,iXi,i + Z, 



bA : 



1, 



,B, 



1, 



,N, (1) 



where Z^i is i.i.d. additive Gaussian noise with variance Ez', 
Xi,^i is channel input with power constraint 



1 



B N 
6=1 i=l 



'bA 



< E 



X, 



Hi,i is channel gain random variable; N is blocklength; and 
B is number of blocks. For this moment, Hbi are assumed to 
be constant within a block and follow an i.i.d. fading process 
over blocks. In particular, for the two sates case {hi, h2] we 
consider, omitting the indices, the distribution of H is given 
by Pr{iJ = hi} = qi and Pr{iJ = /la} = ga = 1 - qi- 

Using BPSK modulation, any codeword produced by en- 
coder is mapped to signal with element in {—\/Elx, +\/Ex}- 
After utilizing a BPSK demodulation at the decoder, the 
equivalent channel can be formulated as a binary symmetric 
channel, with transition probability relating to channel states. 
More specifically, the converted channel is given by 



n.. = Xb^, ® Zf,,,;, 6 = 1, . . . , B, i = 1, 



,N, (2) 



where Xi,,i and Yf, ^ are both Bernoulli random variables repre- 
senting channel input and output correspondingly; Zf, ^ is i.i.d. 
channel noise, also distributed as Bernoulli random variable, 
but related to channel state. More precisely, if Hjyi — hi, then 

Pr{Zb,, = !} = !- ^hiVsm) = pi, (3) 

and if Hi, i = /12, then 

Pr{Zb,, = !} = !- ^h^VSNR) ^ p^, (4) 

where <&(•) is CDF of normal distribution and SNR = 
Ex/Ez- In other words, the channel can be modeled as 
Wi =BSC(pi) with probabiHty qi, and as W2 =BSC(p2) 
with probability 52- 

The ergodic capacity of the converted channel (fading BSC) 
is given by fl2) 

Csi-D = gi[l - H{pi)] +q2[l- H{p2)], (5) 
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Fig. 1: Illustration of polarizations for two BSCs. Values of 
I{W^ ), the reordered mutual information, are shown for 
both polarizations. The blue-solid plot represents the channel 
with higher transition probability pi, and the red-dashed for 
P2- Three index categories are denoted by Q, M., and B in 
order. 



where H{-) is the binary entropy function, and SI-D refers 
to channel state information at the decoder. The capacity 
achieving input distribution is uniform over {0, 1}. In this 
paper, we show a polar coding scheme achieving the capacity 
of converted fading channel with low encoding and decoding 
complexity, without having instantaneous channel state infor- 
mation at the transmitter (only the statistical knowledge is 
assumed). 

IV. Polar Coding for Fading Channel 
A. Intuition 

In polar coding for a general B-DMC W, we have seen the 
channel can be polarized by transforming a set of independent 
copies of given channels into a new set of channels whose 
symmetric capacities tend to or 1 for all but a vanishing frac- 
tion of indices. To this end, an information set A is constructed 
by picking the indices corresponding to K minimum values of 
Z{W^ ), which is equivalent to picking those corresponding 
to K largest values of /(W^ ). In this sense, the construction 
of A is deterministic. However, as indicated in IT], the indices 
in A are not adjacent. For this, we introduce a permutation 
TT : {!,..., N} -^ {!...., N}, which reorders all the indices 
by the value of I{W]^ ) ranging from high to low. Note that 
the construction of polar codes already implies the fact that 
for channels of the same type, their permutation mappings are 
the same. 

Another fact about polar codes is that the polarization is 
uniform ifTSl . Consider polarizing two B-DMCs, for instance 
BSCs with parameters pi and p2 respectively, then the infor- 
mation sets, denoted by Ai and A2, satisfy 



Ai C A2 if Pi > P2- 



(6) 



In other words, if a particular channel index constructed from 
the worse channel (BSC with larger transition probability) 
polarizes to be noiseless, so does that of the better channel 
(BSC with smaller transition probability). Based on this 
observation, when polarizing two Wi and W2 with transition 
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Fig. 2: Illustration of polar encoder for fading channel with two states. In this numerical example, we assume N = 16, 
B = 8, \A\ — 5, 1^1 — 7, \Ai\ — 5, and \B\ = 4. Bits in blue are information bits, and ones in white are frozen to zeros. After 
encoding of Phase 1, the codewords are rotated and embed to the messages of Phase 2 to generate the finalized codeword. 



probabilities defined by (J3]l and (Hll respectively, the indices 
after permutation tt can be divided into three categories 
(illustrated in Fig. [T]). Without loss of generality, we assume 

Pi >P2- 

1) Q: both channels are good, i.e. 

2) Ai: only channel 2 is good, while channel 1 is bad, i.e. 

3) B: both channels are bad, i.e. 

i{w^f^^) ^ 0, Hw^"^'^^) 



0. 



Denote the information sets for two channels as Ai and A2 
correspondingly, then obviously Ai = Q, and A2 = G ^ -M- 
Moreover, we have: 

\g\^\Ai\=N[l-H{p,)-e], (7) 

1^1 - 1^2! - l^il = N[H{pi) - H{p2)], (8) 

\B\=N-\A2\^N[H{p2)+ei (9) 

where e is a arbitrary small positive number. 

For the fading channel, we consider the transmitter has 
no prior knowledge of channel states before transmitting, 
hence, coding over channels with indices in Ai is challenging. 
Observe that for those channels, with probability 52 they 
are nearly noiseless, and with probability qi they are purely 
noisy. To this end, each channel can be modeled as a binary 
erasure channel (BEC) from the viewpoint of blocks, and we 
denote this channel as W. This intuition inspires our design 
of encoder and decoder for fading channels. 

B. Encoder 

The encoding process of polar coding for fading channel 
has two phases, hierarchically using polar codes to generate 
A^B-length codewords, where N is blocklength and B is the 
number of blocks. 



]) Phase!: Consider a set of S-length block messages 'y'^'^^ 
with k £ {!,..., |A^|}. For every u^'^^ construct polar code 
yC^), which is G^-coset code with parameter {B, \A\,A,0), 
where A is the information set for W = BEC(gi), and we 
choose 

\A\ = {l-qi-e)B. (10) 

In other words, we construct a set of polar codes, where each 
code corresponds to an index in set Ai, with the same rate 1 — 
qi — e, the same information set A, and the same frozen values 
as well. Mathematically, if denote the reordering permutation 
for W as TT, then 



tt{v 



(fe)^ 



„(fe) 



^(fc) 



') = K■^...,^^[^;,o,...,o], (11) 

= v'^''^Gb. (12) 

2) Phase 2: Consider another set of A^-length messages 
w'^'') with I e {l,...,B}. For every u'-''\ construct po- 
lar code x^''-', which is Gjv-coset codes with parameter 
(N, \g\,g,Ugl)), where Q is BSC information set with size 
given by (|7|. Remarkably, we do not froze all non-information 
bits to be 0, but embed the blockwise codewords from Phase 
1. More precisely, if denote the permutation operator of BSC 
as TT, then 



tt{u^'''>) = 



,(&) 



.,U 



(fc) ~(1) 



.,4'-^",o,. 



,,0] 



(13) 
u'^'^'^Gn- (14) 

By collecting all {x^^'^i-b together, the encoder generates and 
outputs a codeword with length NB. An example to illustrate 
this encoding process is shown in Fig. l2j 

C. Decoder 

After receiving the sequence yi;NB from channel, the 
decoder's task is trying to make estimates {v'^''^}i;\m\ ^i^d 
{u'^''^}i:B, such that the information bits in both sets of 
messages match the ones at the transmitter end with high 
probability. Rewrite channel output yi;NB as a B x N matrix, 
with row vectors {y^^^}i:B- As that of the encoding process, 
the decoding process also has two phases: 
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Fig. 3; Illustration of polar decoder for fading channel with two states. We use the same parameters as encoder. After 
Phase 1, decoder outputs all estimates {m^'^}i:b using BSC SC decoder based on channel states, then selected columns of 
decoded results are rotated and delivered as inputs to Phase 2. In the next phase, the decoder basically uses BEC SC decoder 
to decode v'^^^ from iS^^ for every k e {1, . . . , |A^|}. Bits in shade represent for erasures. 



1} Phase 1: For every 6 G {1, . . . , B}, decode u*^''-' from 
y'^^^ using SC decoder. More precisely, because at the receiver 
end, channel state is available, then receiver can adopt the 
corresponding SC decoder for BSC based on the channel 
state observed. Remarkably, for index in M., we decode as 
erasure, denoted as "e", for bad channel state. To this end, 
polar decoder is given by: if the channel state is hi, then use 
Decoder 7, otherwise use Decoder 2, where the two decoders 
are expressed follows: 
— Decoder 1: 



(6) A 



Ul = 




if & e S, 
if 6eA^, 



(fc) 



in the order i from 1 to N, where 



Decoder 2: 

Ab) A 



411) = 






W} 



,(!/("', «i?i_ ill) 



Otherwise. 



0, 






ifbeB, 

if beg UM, 



in the order i from 1 to N, where 



d^M'l^ii) 



if 



w. 



otherwise. 



>1, 



After decoding from y''') block by block, the decoder output 
a B X N matrix U with rows {u'^'^^i-.b- 

2) Phase 2: Select columns of U with indices in A4 after 
permutation tt to construct a Bx \A4 \ matrix U. Consider each 
column of U, denoted by m'*^) for k e {1, . . . , |A^|}, as the 
input to decoder in Phase 2. Then receiver aims to decode v'^''^ 
from u''^^ using SC decoder with respect to W =BEC(gi). 



More formally, the decoder in Phase 2 is expressed as follow: 
— Decoder 3: 



.(fc) A 



0, 



if fc e A", 
if fc e ^, 



in the order j from 1 to B, where 



d,(i^'Kv^U^ 



n if " *• ' i^3-ii^^ > 1 

1, Otherwise. 



After Phase 2, the decoder output a \A4\ x B matrix V with 
rows {v^''^i:\M\- An example to illustrate the decoding of 
both phases is shown in Fig. l3] 

D. Achievable Rate 

We want to show the rate in proposed polar coding scheme 
achieves the capacity of converted fading channel given by 
pll. Intuitively, by using BSC SC decoders corresponding to 
channel states, the output from Phase 1 successfully recovers 
all information bits in {u*^''''}i:B. Moreover, for those with in- 
dices corresponding to A4, the decoder could decode correctly 
if channel state is h2, and set to erasures otherwise. Thus, the 
input to decoding Phase 2, vector u'-'^' can be considered as an 
output of BEC((7i), hence BEC SC decoder could decode all 
information bits in w''''' correctly for any k e {1, . . . , |A^|}. 

More formally, we have the following theorem. 

Theorem 1. The proposed polar coding scheme achieves any 
rate R < Csid with arbitrarily small error probability for 
sufficiently large N and B. 

Proof: The proof is straightforward by utilizing error 
bound from polar coding. In Phase 1 of decoding, the er- 
ror probability of recovering u^''' correctly for each b £ 
{1, . . . ,B} is given by 



P. 



(b) 



l,e 



0(2-^" ), 



(15) 



where /3 < 1/2. Similarly, in decoding Phase 2, the error prob- 
ability of recovering v'^'^^ correctly for each k e {1, . . . , M} 
is given by 



3(fe) 

2,e 



0{2-^^). 



(16) 



Hence, by union bound, the total decoding error probability is 
upper bounded by 



P.< 



B \M\ 

2,e 
6=1 fe=l 



E^i^^i + E^i 



= 0(B2-^'') + 0(iV2-^ ) 

when A^ and B tend to infinity. In particular, we consider 
5 = 0(2^") and Ar = o(2^^). 

Moreover, from the analysis, it is evident that all messages 
bits in u*^'^) and u^^' are decodable, then the achievable rate 
for the designed scheme is given by 

R=^{\M\\A\ + B\g\] 

- ^{mH{p,) - H{p2)]B[l -qi-e] 

+ BN[l-H{pi)-e]} 
= q,[l-H{p,)]+q2[l-H{p2)]-die) 

where we have used (|7]l, ([8| and ([TO]l, and 

(5(e) = e[l + H{pi) - H{p2)] ^0, as e ^ 0. 
Thus, any rate R < Csid is achievable. ■ 

E. Complexity Analysis 

As we have seen, polar coding schemes for both BSC 
and BEC have relatively low complexity. Since the proposed 
polar coding scheme for fading channel hierarchically utilizes 
polar codes, the character of low complexity is consequently 
inherited. More precisely, \Ai\ number of i?-length polar 
codes as well as B number of A^-length polar codes are 
utilized. Hence, the overall complexity of the coding scheme, 
for both encoding and decoding, is given by 

\M\ ■ 0{B log B) + B ■ 0{N log N) = 0{NBlog{NB)). 

V. Discussion 

In this section, we generalize the polar coding scheme to 
fading channels with arbitrary finite number of states. Assume 
channel gain Hbi has S states from set {hi, . . . , hs}, where S 
is a positive integer Assume the distribution of Hi,j omitting 
indices is given by Pi{H — hs} = Qs, where s e {1, . . . , S"}. 
Then the converted channel using BPSK, defined in (|2]i, is still 
a BSC, whereas with probability g^, the transition probability 
is given by 



Denote the converted BSC corresponding to state hg as Wg, 
then the capacity of converted channel is given by 



s 



Csi.D^}_^qs[l-H{ps)], 

s=l 



(18) 



where 1 — H{ps) is the capacity of Wg- 

Observe that when polarizing S BSCs with different tran- 
sition probabilities, the indices could be divided into 5* + 1 
sets after permutation tt. More mixture sets M.i, . . . , M.s-i 
are defined in this case. Without loss of generality, we assume 
Pi>P2>--->Ps- Then \Ms\ = H{ps) - H{ps+i), and 
for index in set Ms, Wi, . . . , Wg are polarized to be purely 
noisy and all others to be noiseless. To this end, we consider 
a BEC with erasure probability Cs — J2t=i It ^'^ characterize 
the polarization result for index in A4. (See Fig. |4] for an 
intuition.) 
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Fig. 4: Illustration of polarizations for S BSCs. There are 

5—1 mixture sets, denoted as Aii, . . . ,Mg_i. 

Polar coding scheme designed for this channel is similar 
In Phase 1 of encoding, transmitter needs to generate 5—1 
sets of polar codes, where each one is G^-coset codes with 
parameter {B, \As\,As, 0) with respect to BEC(es), and all the 
encoded codewords are embed into messages for Phase 2. At 
the receiver end. Phase 1 should use one of 5 SC decoders for 
BSC to decode U(;), based on observation of channel states. 
Then in Phase 2, 5 — 1 BEC SC decoders are working in 
parallel to recover the information bits. By adopting this polar 
coding scheme, the achievable rate is given by 

2_lB\g\ + J2\Ms\\As\\ 

s-i 
= [1 - H{pi) - e] + J2[HiPs) - H{ps+i)]{l ~es~e) 

3 = 1 

s 
^Y.'isi^'H{ps)]~S'{e), 

s=l 

where S'{e) = e[l + H{pi) — H{ps)]- Thus, the proposed 
polar coding scheme achieves the capacity of channel, and the 
encoding and decoding complexities are both given by 

s-i 

Y^ \Ms\-0{BlogB)+B-0{NlogN) = 0{NBlog{NB)), 



R = 



s 
which is irrelative to the value of 5*, as ^ \-Ms\ < N. 



s=l 
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